library(tidyverse)
library(writexl)
library(plyr)
----------------------------------------------------------------------------------------------------------------------------------
You have loaded plyr after dplyr - this is likely to cause problems.
If you need functions from both plyr and dplyr, please load plyr first, then dplyr:
library(plyr); library(dplyr)
----------------------------------------------------------------------------------------------------------------------------------

Attaching package: ‘plyr’

The following objects are masked from ‘package:dplyr’:

    arrange, count, desc, failwith, id, mutate, rename, summarise, summarize

The following object is masked from ‘package:purrr’:

    compact
library(lubridate)

Attaching package: ‘lubridate’

The following objects are masked from ‘package:base’:

    date, intersect, setdiff, union
library(plotly)
Registered S3 method overwritten by 'data.table':
  method           from
  print.data.table     
Registered S3 method overwritten by 'htmlwidgets':
  method           from         
  print.htmlwidget tools:rstudio

Attaching package: ‘plotly’

The following objects are masked from ‘package:plyr’:

    arrange, mutate, rename, summarise

The following object is masked from ‘package:ggplot2’:

    last_plot

The following object is masked from ‘package:stats’:

    filter

The following object is masked from ‘package:graphics’:

    layout

Exchange rate and sentiment: is there a connection?

Before we can start with our machine learning model we need to understand the relationship between the two variables, therefore we should calculate covariance. This measures the direction of a relationship between the two variables.

First step: creating a dataframe from the csv

btc_exchange_rate_history <- read.csv("D:/Suli/Szakdolgozat1/development_n_stuff/aggregated_data.csv") %>%
                             select(-X) %>%
                             mutate(Date = as_date(Date))
Error in `mutate()`:
! Problem while computing `Date = as_date(Date)`.
Caused by error in `as.Date.default()`:
! do not know how to convert 'x' to class “Date”
Backtrace:
  1. ... %>% mutate(Date = as_date(Date))
  8. lubridate::as_date(Date)
 10. base::as.Date.default(x, ...)
 11. base::stop(...)

Second step: plotting the data on a scatterplot

todo: exchange price changehez nézni, nem az árhoz

btc_sent_plot
No trace type specified:
  Based on info supplied, a 'scatter' trace seems appropriate.
  Read more about this trace type -> https://plotly.com/r/reference/#scatter
No scatter mode specifed:
  Setting the mode to markers
  Read more about this attribute -> https://plotly.com/r/reference/#scatter-mode
No trace type specified:
  Based on info supplied, a 'scatter' trace seems appropriate.
  Read more about this trace type -> https://plotly.com/r/reference/#scatter
No scatter mode specifed:
  Setting the mode to markers
  Read more about this attribute -> https://plotly.com/r/reference/#scatter-mode

Third step: calculate covariance and correlation

btc_cov
[1] 21506.41

A positive covariance means that the two variables tend to increase or decrease together. Correlation helps us analyze the effect of changes made in one variable over the other variable of the dataset. Now that we know this, we should calculate the strength of the relationship between two, numerically measured, continuous variables.

btc_cor
[1] 0.1836311
LS0tDQp0aXRsZTogIlBsb3RzIGFuZCBncmFwaHMgZm9yIHRoZXNpcyINCm91dHB1dDogaHRtbF9ub3RlYm9vaw0KLS0tDQpgYGB7cn0NCmxpYnJhcnkodGlkeXZlcnNlKQ0KbGlicmFyeSh3cml0ZXhsKQ0KbGlicmFyeShwbHlyKQ0KbGlicmFyeShsdWJyaWRhdGUpDQpsaWJyYXJ5KHBsb3RseSkNCmBgYA0KDQoNCiMjIEV4Y2hhbmdlIHJhdGUgYW5kIHNlbnRpbWVudDogaXMgdGhlcmUgYSBjb25uZWN0aW9uPw0KQmVmb3JlIHdlIGNhbiBzdGFydCB3aXRoIG91ciBtYWNoaW5lIGxlYXJuaW5nIG1vZGVsIHdlIG5lZWQgdG8gdW5kZXJzdGFuZCB0aGUgcmVsYXRpb25zaGlwIGJldHdlZW4gdGhlIHR3byB2YXJpYWJsZXMsIHRoZXJlZm9yZSB3ZSBzaG91bGQgY2FsY3VsYXRlIGNvdmFyaWFuY2UuIFRoaXMgbWVhc3VyZXMgdGhlIGRpcmVjdGlvbiBvZiBhIHJlbGF0aW9uc2hpcCBiZXR3ZWVuIHRoZSB0d28gdmFyaWFibGVzLg0KDQojIEZpcnN0IHN0ZXA6IGNyZWF0aW5nIGEgZGF0YWZyYW1lIGZyb20gdGhlIGNzdg0KYGBge3J9DQpidGNfZXhjaGFuZ2VfcmF0ZV9oaXN0b3J5IDwtIHJlYWQuY3N2KCJEOi9TdWxpL1N6YWtkb2xnb3phdDEvZGV2ZWxvcG1lbnRfbl9zdHVmZi9hZ2dyZWdhdGVkX2RhdGEuY3N2IikgJT4lDQogICAgICAgICAgICAgICAgICAgICAgICAgICAgIHNlbGVjdCgtWCkgJT4lDQogICAgICAgICAgICAgICAgICAgICAgICAgICAgIG11dGF0ZShjcmVhdGVkX2F0ID0gYXNfZGF0ZShjcmVhdGVkX2F0KSkNCg0KDQoNCmJ0Y191c2RfdHdlZXRzX2NvbWJpbmVkIDwtIHJlYWQuY3N2KCJEOi9TdWxpL1N6YWtkb2xnb3phdDEvZGF0YV90b19iZV9jbGVhbmVkL2NvaW5fQml0Y29pbi5jc3YiKSAlPiUNCiAgICAgICAgICAgICAgICAgICAgICAgICAgIG11dGF0ZShEYXRlID0gYXNfZGF0ZXRpbWUoRGF0ZSkpICU+JQ0KICAgICAgICAgICAgICAgICAgICAgICAgICAgbXV0YXRlKHllYXIgPSB5ZWFyKERhdGUpLCBtb250aCA9IG1vbnRoKERhdGUpLCBkYXkgPSBkYXkoRGF0ZSkpICU+JQ0KICAgICAgICAgICAgICAgICAgICAgICAgICAgbXV0YXRlKERhdGUgPSBtYWtlX2RhdGUoeWVhcixtb250aCxkYXkpKSAlPiUgDQogICAgICAgICAgICAgICAgICAgICAgICAgICBmaWx0ZXIoRGF0ZSA+PSAiMjAxOS0wMS0wMSIgJiBEYXRlIDw9ICIyMDIyLTAzLTMwIikgJT4lDQogICAgICAgICAgICAgICAgICAgICAgICAgICBpbm5lcl9qb2luKGJ0Y19leGNoYW5nZV9yYXRlX2hpc3RvcnksIGJ5ID0gYygiRGF0ZSIgPSAiY3JlYXRlZF9hdCIpKSAlPiUgDQogICAgICAgICAgICAgICAgICAgICAgICAgICBzZWxlY3QoYygteWVhciwtbW9udGgsLWRheSkpDQoNCmBgYA0KDQpgYGB7cn0NCmJ0Y19zZW50X2xpbmVwbG90IDwtIHBsb3RfbHkoZGF0YSA9IGJ0Y191c2RfdHdlZXRzX2NvbWJpbmVkLCB4ID0gfkRhdGUsIHkgPSB+Q2xvc2UsIG5hbWUgPSdleGNoYW5nZSByYXRlJywgdHlwZSA9ICdzY2F0dGVyJywgbW9kZSA9ICdsaW5lcycpDQoNCmJ0Y19zZW50X2xpbmVwbG90IDwtIGJ0Y19zZW50X2xpbmVwbG90ICU+JSBhZGRfdHJhY2UoeSA9IH5kYWlseV9hdmdfc2VudCwgbmFtZSA9ICdhdmcuIHNlbnRpbWVudCcsIG1vZGUgPSAnbGluZXMnKSANCg0KYnRjX3NlbnRfbGluZXBsb3QNCmBgYA0KDQojIFNlY29uZCBzdGVwOiBwbG90dGluZyB0aGUgZGF0YSBvbiBhIHNjYXR0ZXJwbG90DQp0b2RvOiBleGNoYW5nZSBwcmljZSBjaGFuZ2VoZXogbsOpem5pLCBuZW0gYXogw6FyaG96DQoNCg0KYGBge3J9DQpidGNfc2VudF9zY2F0dGVycGxvdCA8LSBwbG90X2x5KGRhdGEgPSBidGNfdXNkX3R3ZWV0c19jb21iaW5lZCwgeSA9ICB+ZGFpbHlfYXZnX3NlbnQsIHggPSB+Q2xvc2UsDQogICAgICAgICAgICAgICAgICAgICAgICAgICAgbWFya2VyID0gbGlzdChzaXplID0gNCwNCiAgICAgICAgICAgICAgICAgICAgICAgICAgICAgY29sb3IgPSAncmdiYSgyNTUsIDE4MiwgMTkzLCAuOSknLA0KICAgICAgICAgICAgICAgICAgICAgICAgICAgICBsaW5lID0gbGlzdChjb2xvciA9ICdyZ2JhKDE1MiwgMCwgMCwgLjgpJywgd2lkdGggPSAxKSkpDQoNCmJ0Y19zZW50X3NjYXR0ZXJwbG90DQpgYGANCg0KIyBUaGlyZCBzdGVwOiBjYWxjdWxhdGUgY292YXJpYW5jZSBhbmQgY29ycmVsYXRpb24NCg0KYGBge3J9DQpidGNfY292IDwtIGNvdihidGNfdXNkX3R3ZWV0c19jb21iaW5lZCRkYWlseV9hdmdfc2VudCxidGNfdXNkX3R3ZWV0c19jb21iaW5lZCRDbG9zZSwgbWV0aG9kID0gInNwZWFybWFuIikNCg0KYnRjX2Nvdg0KDQpgYGANCkEgcG9zaXRpdmUgY292YXJpYW5jZSBtZWFucyB0aGF0IHRoZSB0d28gdmFyaWFibGVzIHRlbmQgdG8gaW5jcmVhc2Ugb3IgZGVjcmVhc2UgdG9nZXRoZXIuIA0KQ29ycmVsYXRpb24gaGVscHMgdXMgYW5hbHl6ZSB0aGUgZWZmZWN0IG9mIGNoYW5nZXMgbWFkZSBpbiBvbmUgdmFyaWFibGUgb3ZlciB0aGUgb3RoZXIgdmFyaWFibGUgb2YgdGhlIGRhdGFzZXQuDQpOb3cgdGhhdCB3ZSBrbm93IHRoaXMsIHdlIHNob3VsZCBjYWxjdWxhdGUgdGhlIHN0cmVuZ3RoIG9mIHRoZSByZWxhdGlvbnNoaXAgYmV0d2VlbiB0d28sIG51bWVyaWNhbGx5IG1lYXN1cmVkLCBjb250aW51b3VzIHZhcmlhYmxlcy4NCg0KYGBge3J9DQpidGNfY29yIDwtIGNvcihidGNfdXNkX3R3ZWV0c19jb21iaW5lZCRkYWlseV9hdmdfc2VudCxidGNfdXNkX3R3ZWV0c19jb21iaW5lZCRDbG9zZSwgbWV0aG9kPSJzcGVhcm1hbiIpDQoNCmJ0Y19jb3INCmBgYA0KDQoNCg==